Cross-Lingual Validation of Multilingual Wordnets
نویسندگان
چکیده
Incorporating Wordnet or its monolingual followers in modern NLP-based systems already represents a general trend motivated by numerous reports showing significant improvements in the overall performances of these systems. Multilingual wordnets, such as EuroWordNet or BalkaNet, represent one step further with great promises in the domain of multilingual processing. The paper describes one possible way to check the quality (correctness and completeness) of the interlingual alignments of several wordnets and pinpoints the possible omissions or alignment errors.
منابع مشابه
Bilingual emb e ddings with random walks over multilingual wordnets
Bilingual word embeddings represent words of two languages in the same space, and allow to transfer knowledge from one language to the other without machine translation. The main approach is to train monolingual embeddings first and then map them using bilingual dictionaries. In this work, we present a novel method to learn bilingual embeddings based on multilingual knowledge bases (KB) such as...
متن کاملEurowordnet: a Multilingual Database of Autonomous and Language-specific Wordnets Connected via an Inter - Lingual-index
This paper describes the multilingual design of the EuroWordNet database. The EuroWordNet database stores wordnets as autonomous language-specific structures that are interconnected via an Inter-Lingual-Index (ILI). In this paper, we discuss the possibilities to create mappings from each wordnet to the central ILI and how the ILI itself can be adapted to provide more overlap across the wordnets...
متن کاملFostering Intercultural Collaboration: A Web Service Architecture for Cross-Fertilization of Distributed Wordnets
Enhancing the development of multilingual lexicons is of foremost importance for intercultural collaboration to take place, as multilingual lexicons are the cornerstone of several multilingual applications. However, the development and maintenance of large-scale, robust multilingual dictionaries is a tantalizing task. In this paper we present a tool, based on a web service architecture, enablin...
متن کاملEuroWordNet: a multilingual database for information retrieval
The aim of the EuroWordNet-project is the development of a database with wordnets for English, Spanish, Dutch and Italian, similar to the Princeton WordNet1.5, which contains basic semantic relations between words in English. The Dutch, Italian and Spanish wordnets will be linked to the WordNet1.5 using equivalence relations. The resulting multilingual database can directly be used in (multi-li...
متن کاملLinking and Validating Nordic and Baltic Wordnets - A Multilingual Action in META-NORD
This project report describes a multilingual wordnet initiative embarked in the META-NORD project and concerned with the validation and pilot linking between Nordic and Baltic wordnets. The builders of these wordnets have applied very different compilation strategies: The Danish, Icelandic and Swedish wordnets are being developed via monolingual dictionaries and corpora and subsequently linked ...
متن کامل